Real Time Metagenomics: Using k-mers to annotate metagenomes

نویسندگان

  • Robert A. Edwards
  • Robert Olson
  • Terry Disz
  • Gordon D. Pusch
  • Veronika Vonstein
  • Rick L. Stevens
  • Ross A. Overbeek
چکیده

Annotation of metagenomes involves comparing the individual sequence reads with a database of known sequences and assigning a unique function to each read. This is a time-consuming task that is computationally intensive (though not computationally complex). Here we present a novel approach to annotate metagenomes using unique k-mer oligopeptide sequences from 7 to 12 amino acids long. We demonstrate that k-mer-based annotations are faster and approach the sensitivity and precision of blastx-based annotations without loosing accuracy. A last-common ancestor approach was also developed to describe the members of the community.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Concurrent Subtractive Assembly Approach for Identification of Disease Associated Sub-metagenomes

Comparative analysis of metagenomes can be used to detect sub-metagenomes (species or gene sets) that are associated with specific phenotypes (e.g., host status). The typical workflow is to assemble and annotate metagenomic datasets individually or as a whole, followed by statistical tests to identify differentially abundant species/genes. We previously developed subtractive assembly (SA), a de...

متن کامل

Real - Time Metagenomics

In the last few years a new technology called metagenomics has revolutionized biology. This technique allows biologists to sequence the DNA (genetic makeup) of all the organisms in an environment. The Real-Time Metagenomics project provides biologists with a variety of tools to annotate metagenomes using web 2.0 technology, including web services (RTMg.web), Google’s Android cell phone operatin...

متن کامل

SKraken: Fast and Sensitive Classification of Short Metagenomic Reads based on Filtering Uninformative k-mers

The study of microbial communities is an emerging field that is revolutionizing many disciplines from ecology to medicine. The major problem when analyzing a metagenomic sample is to taxonomic annotate its reads in order to identify the species in the sample and their relative abundance. Many tools have been developed in the recent years, however the performance in terms of precision and speed ...

متن کامل

Evaluation of methods to concentrate and purify ocean virus communities through comparative, replicated metagenomics

Viruses have global impact through mortality, nutrient cycling and horizontal gene transfer, yet their study is limited by complex methodologies with little validation. Here, we use triplicate metagenomes to compare common aquatic viral concentration and purification methods across four combinations as follows: (i) tangential flow filtration (TFF) and DNase + CsCl, (ii) FeCl3 precipitation and ...

متن کامل

Fast and sensitive taxonomic classification for metagenomics with Kaiju

Metagenomics emerged as an important field of research not only in microbial ecology but also for human health and disease, and metagenomic studies are performed on increasingly larger scales. While recent taxonomic classification programs achieve high speed by comparing genomic k-mers, they often lack sensitivity for overcoming evolutionary divergence, so that large fractions of the metagenomi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2012